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Asynchronous  techniques — that  is,  techniques  that  do  not  use  clocks  to 
implement  sequencing — are  currently  attracting  considerable  interest 
for  digital  VLSI  circuit  design  [3,  4,  6,  8,  1,  11],  particularly  when  the 
circuits  produced  are  delay-insensitive  (DI).  A  digital  circuit  is  DI  when 
its  correct  operation  is  independent  of  the  delays  in  operators  and  in 
the  wires  connecting  the  operators,  except  that  the  delays  are  finite 
and  positive. 

In  this  paper,  we  characterize  the  class  of  circuits  that  are  entirely 
DI,  and  we  show  that  this  class  is  surprisingly  limited:  Practically  all 
circuits  of  interest  fall  outside  the  class  since  closed  circuits  inside  the 
class  may  contain  only  C-elements  as  multiple-input  operators. 

The  paper  is  organized  as  follows:  First,  we  introduce  the  stable 
gate  model  of  DI  circuits,  which  is  based  on  the  notion  of  production 
rules  (PRs)  as  elementary  computation  steps.  We  then  define  a  partial 
ordering  on  transitions  in  the  circuits.  We  prove  that  all  DI  circuits 
have  to  fulfill  the  so-called  Unique-Successor-Set  criterion;  and  we  show 
that  the  class  of  circuits  that  meet  this  criterion  is  very  limited.  We 
also  give  a  characterization  of  the  class  of  computations  that  admit  a 
DI  implementation.  Finally,  we  discuss  what  we  consider  to  be  the 
weakest  compromise  to  delay-insensitivity,  namely,  isochronic  forks. 


1  Circuits  as  Networks  of  Gates 

A  DI  circuit  is  a  network  of  logical  operators,  or  gates.  A  gate  has 
one  or  more  Boolean  inputs  and  one  Boolean  output.  (Later,  we  will 
introduce  gates  with  multiple  outputs.)  The  state  of  the  circuit  is 
entirely  characterized  by  the  values  of  the  input  and  output  variables 
of  the  gates. 

We  assume  that  all  circuits  are  closed:  Each  variable  of  a  circuit  is 
the  input  of  a  gate  and  also  the  output  of  a  gate.  An  open  circuit  is 
transformed  into  a  closed  one  by  representing  the  environment  of  the 
circuit  as  gates. 
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1.1  Gates  as  Pairs  of  Production  Rules 

The  simple  assignments  x  true  and  x  :=  false  are  denoted  by  x  | 
and  x  1,  respectively.  An  execution  of  a  simple  assignment  is  called  a 
transition.  The  result  of  a  transition  of  type  x  f  is  the  postcondition  x 
(standing  for  the  predicate  “x  is  true”);  the  result  of  a  transition  of  type 
is  the  postcondition  -'X  (standing  for  the  predicate  “x  is  false”). 

A  gate  with  output  variable  ^  is  defined  by  the  two  production  rules: 

Bu  >  2T  T 
Bd  i ->  z  l 

where  Bu  is  the  condition  on  the  input  variables  for  a  transition  of  type 
z  |  to  take  place,  and  Bd  is  the  condition  on  the  input  variables  for  a 
transition  of  type  z  [  to  take  place.  Bu  and  Bd  are  called  the  guards 
of  the  PRs.  They  axe  Boolean  expressions  in  terms  of  the  Boolean 
variables  of  the  circuit. 

We  will  assume  that  a  guard  is  in  disjunctive-normal  form,  that  is, 
it  is  either  a  literal ,  a  term,  or  a  disjunction  of  terms.  A  literal  is  a 
variable  or  its  negation;  a  term  is  a  conjunction  of  literals. 

EXAMPLES:  The  gate,  with  inputs  x  and  y,  and  output  z,  defined 
by  the  two  PRs 

x  Ay  t-*-  z  l 
->x  V  ~>y  i — ►  2 1 

is  usually  called  a  “nand-gate.”  The  gate,  with  inputs  x  and  y,  and 
output  z,  defined  by  the  two  PRs 

x  Ay  >—>■  z | 

->x  A  -'y  i->  z  | 

is  usually  called  a  “Muller-C  element.”  □ 


1.2  Non-interference  and  Stability  of  PRs 

Obviously,  the  simultaneous  execution  of  both  PRs  of  a  gate  would 
result  in  a  malfunctioning  of  the  circuit.  Hence,  the  two  PRs  of  a  gate 
must  fulfill  the  non-interference  requirement. 

Non-interference.  ~<BU  V  -'Bd  is  invariantly  true. 

A  gate  is  a  partial  function  when  the  non-interference  requirement 
is  not  a  tautology  but  has  to  be  maintained  as  a  program  invariant. 
The  flip-flop  is  an  example  of  such  a  gate. 

The  non-interference  requirement  eliminates  the  most  obvious  case 
of  malfunctioning  of  a  gate.  But  other  forms  of  malfunctioning,  usually 


2 


called  hazards,  have  to  be  eliminated  as  well.  A  hazard  is  an  incomplete 
transition  on  the  output  of  a  gate  caused  either  by  two  consecutive 
transitions  on  one  input  variable  or  by  some  concurrent  changes  on 
several  input  variables.  In  our  model,  all  occurrences  of  hazards  are 
eliminated  as  follows. 

At  any  time,  all  PRs  of  a  circuit  with  a  true  guard  are  executed 
concurrently.  The  net  effect  of  the  execution  of  a  PR  is  to  establish 
the  result  of  the  PR;  therefore,  the  execution  of  a  PR  is  considered 
to  be  correctly  terminated  when  the  result  holds.  (The  result  of  a 
PR  is  the  result  of  the  transition  caused  by  an  execution  of  the  PR.) 
The  execution  of  a  PR  correctly  terminates  unless  the  guard  is  falsified 
before  the  result  holds.  In  that  case,  the  net  effect  of  the  execution  is 
undefined.  We  therefore  add  a  requirement  called  stability. 

Stability.  The  guard  of  a  PR  is  stable  in  a  computation  if  it  is  falsified 
only  in  states  where  the  result  of  the  PR  holds. 

(The  stability  of  the  physical  implementation  of  a  PR  also  requires 
that  the  changes  in  value  of  the  physical  quantity — voltage,  in  MOS 
technology — representing  the  Boolean  values  be  monotonic.  However, 
monotonicity  around  the  stable  values  is,  in  general,  neither  attainable, 
because  of  noise,  nor  necessary.) 

If  a  circuit  fulfills  the  non-interference  and  stability  criteria,  no  glitch 
or  hazard  can  corrupt  the  value  of  the  variables.  At  any  point  in  time, 
the  physical  quantity  representing  a  variable  either  has  one  of  the  two 
stable  values  representing  the  two  Boolean  values,  or  is  monotonically 
changing  from  one  stable  value  to  the  other. 

Observe  that,  whereas  non-interference  can  be  a  property  of  either 
the  gate — if  -> Bu  V  ~'Bjf  is  a  tautology — or  of  the  circuit  using  the  gate, 
the  stability  of  a  gate  is  always  a  property  of  the  circuit. 

Any  pair  of  PRs  that  set  and  reset  the  same  output  variable  defines 
a  valid  gate,  with  the  exception  of  self-invalidating  PRs.  A  rule  with 
guard  g  and  result  r  is  self-invalidating  if  r  =>  ~>g  may  hold  as  a  post¬ 
condition  of  a  transition  of  that  rule.  In  other  words,  the  execution  of 
the  rule  may  falsify  the  guard.  For  example,  the  rules  i  w  1}  and 
->x  x  f  are  self-invalidating. 

The  execution  of  a  PR  in  a  state  where  the  result  holds  is  called 
vacuous ;  otherwise,  it  is  called  effective.  From  the  definition  of  the 
execution  of  a  PR,  the  vacuous  execution  of  a  PR  is  equivalent  to  a 
skip.  Consequently,  it  is  always  possible  to  modify  the  guard  of  a  PR 
so  that  it  does  not  contain  the  output  variable  of  the  gate.  (This  is 
achieved  by  removing  all  terms  that  contain  the  result  as  literal.  For 
example,  (x  A  z)  V  y  zf  can  be  replaced  with  y  i— >  z  f,  since  an 
execution  of  the  PR  in  the  state  where  x  A  z  holds  is  vacuous.) 
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Hence,  gates  do  not  contain  variables  that  are  both  input  and  output 
(self-loops).  In  the  sequel,  unless  specified  otherwise,  an  execution  of  a 
PR  is  an  effective  execution. 

1.3  Wires 

A  priori ,  a  wire  with  input  x  and  output  y  is  the  gate  defined  by  the 
PRs  x  i— >  y  |  and  ->x  i— ►  y  j.  But,  since  the  composition  of  any  gate, 
including  a  wire,  with  a  wire  is  the  gate  itself  with  one  of  its  variables 
renamed,  we  can  add  an  arbitrary  number  of  wire  gates  to  a  circuit 
definition  without  actually  changing  the  circuit.  In  order  to  have  a 
unique  network  of  gates  for  each  circuit,  we  exclude  the  wire  from  the 
gates;  a  wire  is  just  a  renaming  mechanism  for  variables. 

So  far  all  gates  except  the  wire  have  more  inputs  than  outputs,  but 
most  circuits  have  as  many  outputs  as  inputs.  We  must  therefore  reset 
the  balance  by  introducing  at  least  one  gate  with  more  outputs  than 
inputs.  This  gate  is  the  fork. 

1.4  Forks  and  Multiple-Output  Gates 

A  fork  has  one  input  and  at  least  two  outputs.  The  fork,  /,  with  input 
x  and  outputs  y  and  z  is  defined  as 

x  t-+yt,*T 
“•*  !,  *  ! 

where  the  comma  means  the  execution  of  the  two  assignments  in  any 
order  or  concurrently.  The  generalization  to  an  arbitrary  number  of 
outputs  is  obvious.  The  gate 


Bu  wxj 
Bd  x  j 

composed  with  fork  /  is  equivalent  to  the  gate  with  outputs  y  and  2 

Bu  i-*  y  T »  *  T 
Bd  >-»  y  i,  z  i  . 

Hence,  the  fork  is  just  a  mechanism  for  replicating  the  outputs  of  a 
gate  and  for  defining  gates  with  an  arbitrary  number  of  outputs.  The 
following  discussion  is  somewhat  simplified  if  we  eliminate  the  fork  and 
allow  instead  the  type  of  multiple-output  gates  that  correspond  to  the 
composition  of  a  single-output  gate  and  a  fork.  But  gates  defined  in 
this  way  have  an  important  restriction:  The  effective  execution  of  a  PR 
of  a  gate  contains  an  effective  transition  on  each  output  of  the  gate. 
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1.5  Summary  of  the  Model 

The  only  restriction  that  these  definitions  and  conventions  introduce 
on  the  class  of  circuits  being  considered  is  the  exclusion  of  gates  with 
self-loops  and  of  arbitration  devices.  Unlike  models  based  on  the  “fun¬ 
damental  mode”  of  operation,  several  inputs  of  a  gate  may  change 
values  simultaneously  as  long  as  the  stability  of  the  guards  of  the  PRs 
is  preserved. 

Also,  we  do  not  assume  that  the  transitions  are  instantaneous:  A 
variable  value  changes  monotonically  from  the  “bottom”  value  repre¬ 
senting  one  logical  value  to  the  “top”  value  representing  the  other  log¬ 
ical  value,  and  vice-versa.  Because  the  transitions  durations  are  finite 
but  positive  and  variable,  the  ordering  of  transitions  in  a  circuit  has  to 
be  defined  with  care. 

2  Partial  Order  of  Transitions 

The  specification  of  a  sequential  circuit  defines  a  partial  order  of  actions 
taken  from  a  repertoire  of  commands.  In  order  to  assert  that  a  circuit 
fulfills  a  specification,  we  must  relate  this  partial  order  to  some  other 
order  relation  among  transitions  of  the  circuit.  The  partial  order  of 
transitions  is  defined  as  follows. 

Consider  an  effective  execution  of  a  PR  causing  the  transition  t.  and 
let  C  be  a  term  of  the  guard  such  that  C  holds  for  this  execution  of 
the  PR. 

We  attach  to  C  a  set,  T,  of  transitions  in  the  following  way.  Each 
literal  of  C  uniquely  defines  a  transition:  The  literal  x  is  the  result  of  a 
transition  of  type  x  |,  and  the  literal  ->x  is  the  result  of  a  transition  of 
type  x  j.  (The  initialization  of  a  variable  is  also  considered  a  transition.) 
By  definition,  we  say  that  transition  t  is  a  successor  of  each  transition  of 
T.  In  other  words,  a  transition  is  the  successor  of  the  set  of  transitions 
that  make  the  guard  true,  including  initializations. 

For  example,  if  the  PRisa:Ayt-+zt,  we  say  that  each  transition 
z  |  is  the  successor  of  a  transition  x  |  and  of  a  transition  y  f. 

If  the  guard  of  the  PR  is  of  the  form  A  V  B,  the  transition  is  the 
successor  of  the  set  of  transitions  that  make  A  true,  or  of  the  set  of 
transitions  that  make  B  true.  Hence,  the  successor  relation  defined  is 
not  unique  for  a  given  circuit.  A  computation  is  a  particular  successor 
relation  on  a  set  of  transitions,  such  that  each  computation  corresponds 
to  a  possible  execution  of  the  circuit.  The  set  of  transitions  of  a  compu¬ 
tation  is  finite  if  the  corresponding  execution  of  the  circuit  terminates, 
and  possibly  infinite  otherwise. 

From  the  successor  relation,  we  can  now  construct  a  relation  -<  that 
is  a  pre-order;  that  is,  it  is  transitive  and  anti-reflexive.  Once  we  have 
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the  pre-order  relation  we  construct  the  partial  order  X  by  defining 
£1  ■<  £2  to  mean  £1  t2  or  £1  =  t2. 

Transitivity.  For  any  two  transitions  tl  and  t2,  we  say  that  tl  -<  t2 
when  12  is  a  successor  of  tl,  or  there  exists  a  transition  £3  such  that 
£1  -<  £3  and  £3  -<  £2. 

Anti-reflexivity.  £  -<  £  holds  for  no  transition  t. 

REMARK:  Anti-reflexivity  is  satisfied  if,  for  each  ring  of  gates  in  the 
circuit,  there  is  always  at  least  one  PR  whose  guard  is  true  and  whose 
result  is  false — the  ring  “oscillates.”  Anti-reflexivity  excludes  rings  of 
gates  that  are  used  to  maintain  constant  values  of  variables,  as  in  cross- 
coupled  device  constructions  of  storage  elements.  We  therefore  assume 
that  the  storage  elements  are  parts  of  “perfect  wires,”  so  to  speak,  that 
keep  the  value  of  a  variable  until  the  next  transition  on  the  variable.  0 

Definition.  A  chain  from  a  to  b  is  a  finite,  non-empty  set  {£,-,  0  <  i  < 
n}  of  transitions  such  that  £0  =  a,  tn  =  b,  and  for  all  i,  0  <  i  <  n,  £,•  is 
a  successor  of  £,_i.  By  construction,  a  <b  means  that  there  is  a  chain 
from  a  to  b.  If  a  -<  b,  we  say  that  b  follows  a. 


3  Implementation  of  Stability 

Consider  again  an  execution  of  a  PR  with  guard  B  and  transition  £. 
Either  B  is  never  falsified  once  it  holds,  but  then  £  is  the  last  transition 
on  the  variable  involved,  and  we  say  that  the  transition  is  final.  Or  B 
is  falsified  after  a  finite  number  of  transitions  following  £,  in  which  case, 
in  order  to  implement  the  stability  of  B,  we  have  to  see  to  it  that  £  is 
completed  before  B  is  falsified. 

For  all  transitions  i  that  falsify  B ,  we  have  to  guarantee  £  -<  i. 
Hence,  by  definition  of  the  order  relation,  there  must  be  a  transition  s 
such  that  5  is  a  successor  of  £,  and  s  X  i.  We  say  that  s  acknowledges 
t.  Hence,  the 

Acknowledgment  Theorem.  In  a  DI  circuit,  each  non-final  transi¬ 
tion  has  a  successor  transition. 

By  construction  of  multiple-output  gates,  we  have  the 

Corollary.  In  a  DI  circuit,  a  non-final  transition  on  an  input  of  a  gate 
has  a  successor  transition  on  each  output  of  the  gate. 

EXAMPLE:  Consider  the  three  following  gates  with  two  inputs,  x 
and  y ,  and  one  output,  z.  The  flip-flop  is  defined  as  x  i— ►  z\  and 
~,y  z  i,  the  asymmetric  C-element  as  x  A  y  i-»  z  |  and  ~>y  i-+  z  j,  and 
the  switch  as  x  A  y  2  f  and  x  A  ->y  z  l. 
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Since  no  guard  of  these  gates  has  a  term  containing  the  literal  ~>x, 
a  transition  of  type  x  j.  has  no  successor.  Hence,  according  to  the 
Acknowledgment  Theorem,  there  can  be  at  most  two  transitions  on  x 
in  any  computation  of  a  DI  circuit  using  any  of  these  three  gates.  0 

4  The  Unique- Successor- Set  Criterion 

Later  on,  we  shall  give  a  simple  criterion  for  deciding  whether  a  given 
circuit — a  network  of  gates — is  DI.  But  such  a  criterion  does  not  tell 
us  whether  there  exists  a  DI  circuit  for  a  given  specification.  We  shall 
therefore  formulate  a  more  general  theorem  that  characterizes  the  par¬ 
tial  orders  of  transitions  that  admit  a  DI  implementation.  This  criterion 
enables  us  to  decide  that  a  program  has  no  DI  implementation  without 

having  to  construct  a  circuit. 

Successor  Set.  In  a  computation,  the  successor  set  of  a  transition  t 
is  the  set  of  variables  x  such  that  a  transition  on  x  is  a  successor  of  t. 

Unique-Successor-Set  Property.  A  computation  has  the  unique- 
successor-set  (USS)  property  when  all  non-final  transitions  on  the  same 
variable  have  the  same  successor  set.  A  set  of  computations  has  the 
USS  property  when  all  non-final  transitions  on  the  same  variable  have 
the  same  successor  set  in  all  computations  of  the  set. 

Unique-Successor-Set  Theorem.  A  set  of  computations  of  a  DI 
circuit  has  the  USS  property. 

Proof.  Consider  an  arbitrary  variable  x  of  a  DI  circuit.  By  the  corol¬ 
lary  of  the  Acknowledgment  Theorem,  any  non-final  transition  t  on  x 
has  a  successor  transition  on  each  output  of  the  gate,  say  G ,  of  which 
x  is  an  input. 

By  definition  of  the  successor  set,  the  set  of  output  variables  of  G 
is  the  successor  set  of  t.  But  since  the  set  of  output  variables  of  a 
gate  is  unique,  the  successor  set  is  the  same  for  all  non-final  transi¬ 
tions  on  x.  0 

5  Characterization  of  DI  computations 

Although  the  Unique-Successor-Set  Theorem  is  a  direct  consequence 
of  the  Acknowledgment  Theorem,  its  formulation  in  terms  of  compu¬ 
tations  instead  of  gates  makes  it  possible  to  lift  the  result  from  the 
implementation  level  to  the  specification  level.  Since  the  partial  orders 
of  actions  defining  a  circuit  are  projections  of  the  partial  orders  of  ac¬ 
tions  implementing  it,  we  shall  investigate  whether  the  USS  property 
is  maintained  by  projection. 
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Definition.  Given  a  computation,  c,  on  a  set  of  variables,  V,  the  pro¬ 
jection  of  c  on  a  subset,  W,  of  V  is  the  computation  derived  from  c  by 
removing  all  transitions  on  variables  ofV\W  from  the  chains  of  c.  The 
projection  of  a  set  of  computations  is  the  set  obtained  by  projecting 
each  element  of  the  original  set. 

Projection  Theorem.  If  a  set  of  computations  has  the  USS  property, 
then  its  projection  on  a  subset  of  variables  has  the  USS  property. 

Proof.  By  definition,  the  projection  of  a  set  of  computations  on  W 
can  be  obtained  by  removing  the  elements  of  V\W  one  for  one  from  all 
chains  of  each  computation  of  the  set.  We  prove  the  theorem  by  showing 
that  removing  all  transitions  on  one  variable,  say,  w,  maintains  the  USS 
property  of  the  set. 

Let  x  be  another  variable,  and  let  X  be  the  USS  of  (all  transitions 
on)  x  in  all  computations  of  the  set.  Either  w  does  not  belong  to  X  and 
X  is  left  unchanged  by  the  transformation,  or  w  is  removed  from  X. 
But  then,  for  each  transition  tx  on  x,  the  successor  set  of  the  transition 
on  w  that  follows  tx  must  be  added  to  the  successor  set  of  tx.  Since 
all  transitions  on  w  have  the  same  successor  set  in  all  computations  of 
the  set,  the  new  X  is  the  same  for  all  transitions  and  all  computations 
of  the  set.  0 

5.1  Example:  One-Place  Buffer 

The  cyclic  program  *  \X ;  Y] ,  where  X  and  Y  are  communication  com¬ 
mands,  is  called  a  one-place  buffer1.  It  is  a  basic  building  block  of  asyn¬ 
chronous  circuit  design  since  it  is  used  to  implement  the  sequencing  of 
any  two  actions.  With  a  four-phase  handshaking  protocol  for  imple¬ 
menting  the  communications,  an  expansion  of  the  program  in  terms  of 
elementary  variables  is: 

♦[[xzjjxoT;  [“>**];  xo  j;  yo  [y*];  yo|;  [->yi]], 

where  xi  and  yi  axe  the  input  variables,  and  xo  and  yo  are  the  output 
variables2.  (See  Figure  1.)  The  environment  of  the  circuit  can  be  simply 
modeled  as  the  two  programs: 

*[x«T;  [xo];  xi  j;  [->xo]] 

These  three  programs  are  concurrent.  Now  observe  that  the  projection 
of  a  computation  on  the  output  variables  of  the  first  program  gives  the 

'The  notation  *[S]  stands  for  the  non-terminating  repetition  of  the  program  S. 

For  an  arbitrary  Boolean  expression  B,  the  command  [B]  is  a  shorthand  notation  for 
[B  — >  sfcip],  and  can  be  informally  defined  as  “wait  until  B  holds.” 
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Figure  1:  A  one-place  buffer  and  its  interface 


computation  described  by  the  program 

^xo^xo  i^yo^yo  [}. 

Obviously,  this  computation  does  not  have  the  USS  property;  there¬ 
fore,  by  the  Projection  Theorem,  the  closed  circuit  implementing  the 
three  programs  is  not  DI.  But  the  two  environment  programs  can  be 
implemented  with  an  inverter  gate  and  an  identity  gate,  which  are  DI 
circuits.  Hence,  there  is  no  DI  circuit  implementing  this  version  of  the 
one-place  buffer  with  four-phase  handshaking. 

We  can  state  a  more  general  result.  We  observe  that,  for  what¬ 
ever  four-phase  handshaking  is  chosen  for  X  and  Y,  the  projection  on 
the  output  variables  is  always  xo  J.;  3/0  f;  yo  J.],  unless  the  hand¬ 

shaking  actions  of  X  are  reordered  (“shuffled”)  with  respect  to  the 
handshaking  actions  of  Y.  Hence,  the 

Theorem.  There  is  no  DI  circuit  implementing  a  one-place  buffer  with 
unshufHed  four-phase  handshaking. 

We  can  shuffle  the  handshaking  actions  of  X  with  respect  to  the 
handshaking  actions  of  Y ,  so  that  the  projection  on  the  output  variables 
is  the  sequence 

*\xo]\yo^xol\yoi]. 

Now,  the  sequence  has  the  USS  property,  and  we  can  implement  the 
one-place  buffer  as  a  DI  circuit.  An  example  is  shown  in  Figure  2. 
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Figure  2:  A  DI  circuit  for  the  one-place  buffer 


6  Specifications  and  the  USS  Property 

The  Projection  Theorem  is  very  useful  because  we  can  also  define  when 
a  specification  has  the  USS  property.  If  a  specification  does  not  have 
the  property,  we  can  immediately  conclude  that  there  exists  no  DI  im¬ 
plementation  of  the  specification.  The  projection  from  implementation 
to  specification  occurs  as  follows. 

We  assume  that,  whatever  specification  notation  is  used,  whether 
programs,  traces,  or  regular  expressions,  it  is  possible  to  derive  from  the 
specification  certain  properties  of  the  partial  order  of  actions  involved. 
Hence,  in  the  sequel,  a  specification  is  a  set  of  partial  orders  of  actions, 
where  an  action  is  an  execution  of  a  command  taken  from  some  given 
repertoire. 

We  also  assume  that  an  elementary  variable  can  be  uniquely  iden¬ 
tified  with  (the  implementation  of)  each  command:  The  transitions  on 
the  variable  occur  only  in  the  executions  of  the  command,  and  each  ex¬ 
ecution  of  the  command  contains  a  transition  on  the  variable.  This  (in 
theory,  slightly  restrictive)  assumption  is  needed  only  for  the  following 

Specification  Theorem.  If  the  specification  of  a  circuit  does  not  have 
the  USS  property,  the  circuit  is  not  DI. 

Proof.  Consider  a  specification,  S,  of  a  circuit.  For  each  command, 
X,  of  S,  we  substitute  a  transition  on  the  elementary  variable  x  that 
is  uniquely  associated  with  X.  We  obtain  a  set,  s,  of  partial  orders 
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of  transitions  on  elementary  variables.  Since  the  existence  of  the  USS 
property  is  independent  of  whether  the  transitions  are  upgoing  or  down¬ 
going  (that  is,  the  “direction”  of  the  transitions),  we  can  decide  whether 
s  has  the  USS  property  even  though  the  direction  of  the  transitions  in 
s  is  undefined. 

By  definition,  we  say  that  specification  S  has  the  USS  property  if 
and  only  if  the  set,  s,  thus  defined  has  the  USS  property.  By  construc¬ 
tion,  s  is  a  projection  of  the  set  of  computations  of  the  circuit  specified 
by  S.  Hence,  by  the  Projection  Theorem  and  the  USS  Theorem,  if  s 
does  not  have  the  USS  property,  the  circuit  is  not  DI.  □ 

EXAMPLES:  The  following  examples,  which  we  give  without  proofs, 
show  how  limited  is  the  class  of  programs  that  admit  a  DI  implemen¬ 
tation.  (In  the  examples,  all  commands  are  different  from  skip.)  We 
assume  that  the  semantics  of  the  program  notation  are  clear  enough 
that  we  can  identify  the  programs  with  the  partial  order  of  actions 
they  represent. 

•  Let  P  =  +[5i;  S2]  —  Sn],  and  assume  that  there  is  no  equivalent 
program 

*[S^S2-,...Sk] 

with  k  <  n.  (We  say  that  P  is  a  minimal  representation.  For  instance, 
* [X ;  X]  is  not  minimal  since  *[X]  is  an  equivalent  program.) 

Then  P  has  the  USS  property  if  and  only  if  Si  ^  Sj  for  i  ^  j. 
Hence,  the  “modulo-2  counter”  *[X;X;Y]  and  all  other  “modulo-k 
counters”  have  no  DI  implementation.  A  similar  result  has  been  proved 
by  C.  J.  Seger[9], 

•  The  program  *[5i;  [i?i  — +  S2\B2  — ►  S3];  £4],  with  S2  ±  S3,  does  not 
have  the  USS  property.  Hence,  there  is  no  DI  circuit  implementing  such 
a  selection  command.  □ 


7  Gate  Characterization  of  DI  Circuits 

We  have  already  seen  that,  apart  from  the  trivial  case  where  one  input 
of  the  gates  changes  at  most  twice,  there  is  no  DI  circuit  that  contains 
either  a  flip-flop,  or  an  asymmetric  C-element,  or  a  switch.  In  the  same 
way,  we  can  use  the  USS  and  the  Projection  Theorems  to  show  that 
there  is  no  DI  circuit  containing  either  an  or-gate,  or  an  and-gate,  or 
an  exclusive-or,  in  which  each  input  of  the  gates  changes  more  than  a 
minimum  number  of  times  specific  to  each  case.  Consider  an  or-gate 
with  inputs  x  and  y  and  output  z.  The  only  sequence3  in  which  each 

3  The  notation  (5)*  is  the  Kleene-star  notation  standing  for  an  arbitrary  number  of 
actions  S  in  sequence. 
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transition  on  an  input  is  acknowledged  is: 

((*  T;  z  T;  *  U  z  |)*;  (y  T;  z  T;  y  i;  z  |)T 

We  easily  see  that  any  computation  that  contains  a  transition  on  both 
inputs  does  not  have  the  USS  property. 

The  cases  of  the  and-gate  and  of  the  exclusive-or  are  treated  simi¬ 
larly  and  are  left  as  an  exercise  for  the  reader.  After  having  eliminated 
all  gates  with  at  most  two  inputs  except  the  inverter  and  the  Muller- 
C  element,  we  are  led  to  conjecture  that  a  DI  circuit  contains  only 
C-elements.  C-elements  are  defined  as  follows. 

Definition.  An  n-input  gate  in  which  Bu  is  the  conjunction  of  the 
n  input  variables  and  Bd  is  the  conjunction  of  the  negations  of  the  n 
input  variables  is  called  an  n-input  C-element.  A  gate  derived  from 
a  C-element  by  negating  one  or  more  literals  in  Bu  or  Bd  is  also  a 
C-element. 

The  Muller-C  element  is  a  two-input  C-element  according  to  our 
definition.  A  one-input  C-element  reduces  to  either  a  wire  or  an  in¬ 
verter. 

C-Element  Theorem.  If  a  DI  circuit  has  only  one  computation,  and 
if  the  computation  contains  at  least  three  transitions  on  each  variable, 
then  the  circuit  can  be  constructed  with  C-elements  only. 

Proof.  Let  x  be  an  arbitrary  variable  of  the  circuit;  x  is  the  input  of 
gate  g  with  output  z.  We  shall  prove  that  g  can  be  implemented  as  a 
C-element.  Since  there  are  no  self- loops,  x  and  z  are  different  variables. 

First,  observe  that  because  of  the  non-interference,  all  transitions 
on  the  same  variable  are  totally  ordered.  And  because  all  transitions 
are  effective,  upgoing  and  downgoing  transitions  on  the  same  variable 
alternate. 

Since  the  circuit  contains  at  least  three  (effective)  transitions  on  each 
variable,  at  least  one  transition  of  type  x  f  is  followed  by  a  transition 
of  type  x  f,  and  at  least  one  transition  of  type  x  j.  is  followed  by  a 
transition  of  type  x  j\ 

Let  fl  be  a  transition  of  type  x  |  and  t2  be  the  transition  of  type 
x  |  following  it.  For  the  guard  of  the  PR  of  /I  to  be  stable,  there  must 
be  a  transition  tz  on  2  such  that  fl  -<  tz  -<  t2.  We  also  know  that  tz  is 
a  successor  of  fl. 

By  the  USS  Theorem  and  the  Projection  Theorem,  there  is  exactly 
one  transition  tz  on  2  such  that  fl  -<  tz  ~<  1 2.  By  the  same  argument, 
there  is  exactly  one  transition  on  2  between  a  transition  of  type  x  j,  and 
the  transition  of  type  x  f  following  it. 

Without  loss  of  generality,  assume  that  the  first  transition  on  x  is  of 
type  x  X  and  the  first  transition  on  2  is  of  type  2  |.  Then,  because  of  the 
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alternation  of  upgoing  and  downgoing  transitions  on  each  variable,  each 
transition  of  type  z  "f  is  the  successor  of  a  transition  of  type  x  f,  and 
each  transition  of  type  z  J.  is  the  successor  of  a  transition  of  type  x  J,. 

By  definition  of  the  successor  relation,  x  holds  as  a  precondition  of 
each  transition  z\\  thus,  guard  Bu  of  g  can  be  formulated  so  that  all 
terms  contain  x,  since  a  term  that  is  never  true  can  be  removed.  Hence, 
Bu  can  be  chosen  of  the  form  x  A  Cu,  where  Cu  does  not  contain  x. 
Symmetrically,  guard  Bd  of  g  can  be  chosen  of  the  form  ~>x  A  Cd,  where 
Cd  does  not  contain  x.  Since  this  property  of  Bu  and  Bd  holds  for  each 
input  of  g,  g  is  a  C-element  or  can  be  replaced  with  a  C-element.  0 

8  Iso  chronic  Forks 

Since  the  class  of  DI  circuits  is  so  limited,  we  must  have  compromised 
the  delay-insensitivity  in  the  circuits  that  we  designed  using  the  syn¬ 
thesis  method  described,  for  instance,  in  [5]  and  [4].  Let  us  analyze  a 
standard  sequencing  circuit  used  in  this  design  style.  (It  is  similar  to 
the  one-place  buffer,  but  is  simpler  to  use  as  an  example.)  This  circuit 
(Figure  3)  is  an  implementation  of  the  sequence  of  elementary  actions: 

*[[**];  yof;  [if*3;«T;  N;W;  hy*];«o  T;  [->**'];«  I;  [-**];  xoi]. 


Figure  3:  A  sequencing  element  containing  isochronic  forks 

The  environment  of  the  circuit  is  the  same  as  that  of  the  one-place 
buffer.  The  x-  and  y- variables  are  each  parts  of  a  four-phase  hand¬ 
shaking  sequence,  and  u  is  a  state  variable — without  u,  it  would  not 
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be  possible  to  encode  each  state  of  the  circuit  uniquely.  Since  the  pro¬ 
jection  of  this  sequence  on  the  variables  xo,  yo ,  and  u  lacks  the  USS 
property,  and  since  the  environment  of  the  circuit  can  be  implemented 
as  an  inverter  and  an  identity,  the  circuit  is  not  DI. 

In  order  to  find  out  where  we  have  cheated,  we  must  look  at  the 
forks.  We  observe  that  xi  is  an  input  both  of  the  and-gate  with  output 
yo  and  of  the  C-element.  Hence,  the  circuit  actually  contains  a  fork 
with  input  xi  and  two  outputs,  say,  xl  and  x2.  Similarly,  the  circuit 
contains  a  fork  with  input  yi,  and  a  fork  with  input  u.  Let  us  analyze 
the  behavior  of  the  first  fork  by  introducing  it  explicitly  into  the  set  of 
PRs  of  the  circuit.  For  the  sake  of  simplicity,  we  ignore  the  other  two 
forks.  We  get: 


xi 

xl  T,  x2 1 

xl  A  ->u 

h-+  yo  f 

x2  A  yi 

i— ►  u  | 

-ixl  V  u 

yoi 

->yi  A  u 

H- >  xoj" 

->xi 

xl  !,x2| 

-*x2  A  -i yi 

»— >•  u  j 

yi  V  ->u 

i— *■  xo| 

Transitions  xl  |  and  x2  f  are  both  acknowledged  by  the  two  PRs 
that  follow.  But  only  transition  x2  f  is  acknowledged.  Transition  xl  j 
is  not  acknowledged.  Hence,  the  circuit  is  not  DI,  because  the  Ac¬ 
knowledgment  Theorem  is  not  satisfied.  Therefore,  the  completion  of 
transition  xl  |  is  not  guaranteed  unless  we  implement  the  fork  as  an 
isochronic  fork ,  which  is  defined  as  follows. 

In  an  isochronic  fork,  when  a  transition  on  one  output  is  acknowl¬ 
edged,  and  thus  completed,  the  transitions  on  all  outputs  are  acknowl¬ 
edged,  and  thus  completed. 

(We  leave  it  as  an  exercise  to  the  reader  to  check  that  the  fork  with 
input  yi  must  also  be  isochronic,  but  not  the  fork  with  input  it.) 

The  implementation  of  an  isochronic  fork  relies  on  two  types  of 
assumptions  about  delays.  First,  we  have  to  assume  that  the  difference 
between  the  delays  in  the  branches  of  the  fork  is  negligible  compared 
to  the  delays  in  the  gates.  This  requirement  is  easy  to  meet  in  current 
MOS  technology  except  when  there  is  an  inverter  on  one  branch  of 
the  fork  and  not  on  the  other  branch(es).  The  fork  with  input  yi  has 
such  an  inverter,  and  therefore,  the  inverter  must  be  removed  by  proper 
circuit  transformations. 

Second,  and  more  important  in  current  technology,  we  have  to  as¬ 
sume  that  the  switching  thresholds  in  the  different  gates  to  which  the 
fork  is  an  input  are  close  enough  to  each  other.  This  requirement  is 
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more  difficult  to  meet  than  the  first  one  because,  on  the  one  hand,  the 
thresholds  of  individual  transistors  are  difficult  to  control — in  particu¬ 
lar  in  CMOS;  on  the  other  hand,  the  switching  thresholds  of  a  gate  vary 
greatly  with  the  logical  design  of  the  gate.  For  these  reasons,  this  re¬ 
quirement  may  impose  a  design  style  in  which  all  gates  are  implemented 
as  combinational  gates,  so  that  the  fight  between  pull-up  and  pull-down 
during  the  switching  of  the  gate  keeps  the  switching  threshold  around 
VDD/2.  Observe  that,  unlike  what  is  advocated  in  other  compromises 
to  delay-insensitivity,  enforcing  the  locality  of  the  wires  offers  little  help 
in  implementing  isochronicity  because  locality  is  irrelevant  to  the  issue 
of  threshold  voltages! 


9  For  Whom  the  Bell  Tolls? 

Are  these  results  tolling  the  bell  for  DI  design?  Actually,  not.  At 
worst,  they  may  slightly  embarrass  those  researchers  who  claim  to  have 
a  design  method  for  entirely  DI  circuits.  At  best,  they  vindicate  the 
compromises  to  delay-insensitivity  adopted  by  several  asynchronous  de¬ 
sign  methods.  Most  likely,  they  are  sobering  reminders  of  the  difficulty 
of  VLSI  design  and  the  novelty  of  asynchronous  design. 

We  have  proved  elsewhere  that  extending  a  standard  repertoire  of  DI 
gates  with  isochronic  forks  is  sufficient  to  construct  any  circuit  of  inter¬ 
est.  The  proof  consists  in  giving  a  circuit  implementation  for  each  con¬ 
struct  of  the  progam  notation  we  use  (see  [2]).  I  believe  the  isochronic 
fork  to  be  the  weakest  possible  compromise  to  delay-insensitivity  in 
the  sense  that  all  other  compromises  also  include  isochronic  forks:  For 
instance,  in  speed-independent  design[7],  all  forks  are  supposed  to  be 
isochronic;  in  self-timed  design [10],  all  forks  inside  a  certain  region — 
called  an  equipotential  region — are  assumed  to  be  isochronic. 
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